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Abstract 

We consider the minimum-weight path between any pair of nodes of 
the n-vertex complete graph in which the weights of the edges are i.i.d. 
exponentially distributed random variables. We show that the longest of 
these minimum-weight paths has about a* logn edges where a* ~ 3.5911 
is the unique solution of the equation a log a — a = 1. This answers a 
question posed by Janson [8]. 



1 Introduction 

We consider the complete graph if„ on n vertices [n] := {1, . . . , n}, augmented 
with independent exponential mean n (or Exp(n) for short) edge weights {Xg : 
e e E{K„)}. For any subgraph H = {V{H),E{H)) of we write \H\ for 
\E{H)\ and let 

w{H)= J2 ^- 

For i,j e [n] we let be the minimum- weight path from i to j and write 
Wij = w{Pij) (adopting the convention = 0, wa = 0). For any fixed vertex i, 
Uj^iPij is a tree, the shortest path tree SFT^ rooted at i. For t >0 and fc € [n], 
we let SPTi(t) be the subtree of SPT^ induced by nodes j with Wij < t. 

Let tl, be the first time for which SPT,;(t) contains at least k + 1 vertices, and 
write ig = 0. Due to the memoryless property of the exponential distribution, for 
each fee [n], the location of attachment of the vertex added at time tj. is uniform 
among the vertices of SPTj;(t5,_-^); in other words, SPTi(i^) is distributed like 
a random recursive tree with k vertices. Random recursive trees have been 
well-studied [13]; in particular, it is known that the depth (number of edges on 
the path from a uniformly random node to the root) in a recursive tree with 
n vertices is asymptotic to log n in probability, sec Devroye [3] , and the height 
(greatest number of edges on any path starting from the root) is asymptotic to 
e log n in probability, see Devroye [3] and Pittel [9] . 
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A variety of authors have studied the weighted structure of a single tree SPTi 
(or SPTi for any i), and of the family of shortest paths {Pij : i,j G [n]}. Van 
der Hofstad, Hooghicmstra, and van Mieghem [14, 15] prove that, letting W be 
the total weight of SPTi, 



iw-m) 



■AA(0,4C(3)) 



in distribution. Here, ( denotes the Riemann zeta function C(.s) := X]fc>i ^ 
Janson [8] proved the beautiful result that 



Wi2 

logn 



1, 



maxi<j<„ Wij 
logri 



and 



niaxi<ij<„ Wj^ 
logn 



in probability, and provided more detailed information about the distribution of 
Wi2 and of maxi<j<„ Wij. (Janson's results in fact hold for some edge weight 
distributions other than exponential, see Section 5.) On the other hand, the un- 
weighted structure of shortest path trees has been studied by Hooghicmstra and 
Van Mieghem [7] who derived the limiting distribution of the pair (14^12, 1-P12I) 
suitably rescaled. 

It is immediate from the aforementioned results for the depth of nodes and 
the height in random recursive trees that 



logn 



1 



and 



maxi<j<„ |Pij| 

log 77, 



in probability. Furthermore, as noted by Janson [8], the tail bounds for the 
height of random recursive trees established by Dcvroyc [3] imply that 



lim sup 



maxi<ij<„ \Pij\ 
logn 



< a* in probability, 



(1) 



where a* « 3.5911 is the unique solution of a log a — a = 1. Janson [8] asked 
whether there is a constant c € [e,a*] such that maxi<ij<„ \Pij\/ logn — *■ c in 
probability. It is the purpose of this note to answer Janson's question in the 
affirmative: 

Theorem 1. For any function uj(n) tending to +00 with n, for some constant 
L > 0, and for all S G (0, 1) , for all n sufficiently large, with probability greater 
than 1 — S, 

a* logn — L loglogn < max \Pij\ < a* logn + uj{n) . 

l<z,j<n 

Furthermore, 



a* log n~ L log log n < E 



max I 

l<z,_7<n 



< a* logn + L. 
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Of course, it follows immediately from the first part of Theorem 1 that 
maxi<ij<„ \Pij\/\ogn a* in probability. The easy half of the result is the 
upper bound as (1) and other results from Devroye [3] will make proving the 
upper bounds of the theorem a routine exercise. The nontrivial part of the result 
is the lower bound which we prove using the second moment method, applied to 
a suitably defined set of shortest paths with special properties that make them 
amenable to analysis. Before we get going, a brief remark is in order: we have to 
work with exponential edge weights in order to exploit the memoryless property 
of the exponential distribution. In the conclusion (Section 5) we explain how 
to extend our results to a range of other edge weights that have a finite and 
positive density at 0, including uniform edge weights. 



2 A useful lemma and the upper bound 

For i g [n] and fc € ["■—!] let = — i^. denote the k-th interarrival time. 
By the memoryless property of the exponential distribution, {r^, 1 < A: < n} are 
independent. Furthermore, is the minimum of k(ri — k) independent expo- 
nential random variables with mean n, hence has an exponential distribution 
with rate parameter k{n — k)/n. Using this fact, the following lemma provides 
upper bounds on the size of SPT; (t) . 

Lemma 2. For all t > and integers m € [n — 1] with m > e*, 

P{|SPT,(t)| >m} <3 - VWe*-e""/'='. 
Proof. Observe that, for any c > 0, 



P{|SPT,(i)| >m}^p\j2rl<t \ = p\YI cxp (-cr^) > e" 



. fc=i ) kfc=i 



By Markov's inequality and the independence of the interarrival times r^, . . . , t^_i, 
we have 

P {|SPT,(t)| > m} < n E [cxp (-cr^)] < J] fc^^ ^ ' 

k=l k=l 

where F is the Gamma function. Using the strengthening of Stirling's formula 
F(c+ 1) < \/2'Kc{c/eye^^^'^ (see, e.g., [10]), and setting c = m/e*, we obtain 

P {|SPT,(t)| >m}< 72^6^/12 . ^ . < 3 . ^ . 

as claimed. □ 

Wc will also use the following result from Devroye [3], providing an upper 
bound on the heights of random recursive trees. In the following, h{T) denotes 
the height of a rooted tree T. 
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Theorem 3 ([3], Theorem 9). Let T,n be a random recursive tree with m > 2 
nodes. Then, for x > 1, 

P{/i(T„0 > xlogm} < e^-iTO^-=''°s=^. 

The upper bounds in Theorem 1 follow immediately from Theorem 3. 

Proof of Theorem 1, upper bounds. For any i > 0, by a union bound we have, 
for n > 2, 

max \Pij I > a* \ogn + t> < nP < max \Pij \ > a* log n + t 

l<i,j<n \ \l<j<n 

The event in the right-hand side above holds if SPTi has height at least a* logn-|- 
t. Since SPTi is distributed like a random recursive tree on n nodes, to further 
bound the probability on the right-hand side, we may use Theorem 3 with m 
replaced by n and x ~ a* + t/ logn. In order to simplify the obtained expression, 
observe also that, by the definition of a* > e, for t > 0, 

X log X — X = ( a* + — ^ — ) log ( a* + — ] — ( a* + — ) > 1 



We obtain that 



log nj \ log nj \ log n J log n 



X x-x\ogx ^ a*-(-t/logn -l+t/logn 



P <^ max |Pij| > a* logn + t ^ < e^n^-^''°s^ < e 

[_l<j"<n J 

and therefore 

P I max |P„| > a*logn + 4 < e"*+*/i°s"e-* . (2) 
[i<»,i<" J 

Choosing t ~ t{n) oo proves the upper bound in probability. Now, using the 
bound (2), we have, for n > 8, 

Mogn-hl-f V2''p( max \P,j\ > a* logn + 2'^-^\ 
jr[ ti<tj<« J 



E 



max \P,j\ 

l<i,j<n 



< a 



< a*logn+l + e"* •^2^cxp(-2^-2) 



£=1 



= a*logn-f 0(1), 

proving the upper bound in expectation. □ 



3 Towards the lower bound 

Fix a path P = z'l, . . . , Wfc+i, fc < 12 logn (where 12 is somewhat arbitrary, the 
key point being that it's bigger than 3a*). It is easily seen that for w > 0, 
letting Po(z«) denote a Poisson mean w random variable, we have 

P {w{P) < wn} = P {Po(uj) >k} — — ■ 

i=k 
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When w = o(l), the above sum is dommated by its first term and e ™ = 1 — o(l), 
so we have 

P{wiP)<wn}^{l + oil)) — . (3) 

We next show that given that P has smaU weight, it is very hkely to be the 
minimum-weight path between its endpoints. More precisely, let P*{v,w) be 
the minimum- weight path in /v„ between two vertices v and w. Then we have 
the following. 

Lemma 4. Fix c > 2. For all n sufficiently large, for all e > c log log n/ log n 
and any path P = vi, . . . , Vk+i in Kn with 1 < k < 12 log we have 

P{P ^ P*{vuvk+i) \w{P)<{l~e)logn}<^. 

The restriction c > 2 is not necessary in the above lemma, but the upper 
bound becomes trivial when e < 2 log log n/ log n. 

Proof. Let SPTi,(t) denote the shortest path tree started from a vertex v and 
stopped at time t. For fc > 2, if P 7^ P*{vi,Vk+i) then it must be the case 
that for some i = 1, . . . , A:, in Kn \ EJ{P), SPT^. ((1 — e) logn) contains one of 

Vi+l, ■ ■ ■,Vk+l. 

Allowing connections using the edges along the path P only increases the 
probabihty of this event. Let be Kn where the edge weights Xe, e £ E{P), 
along the path P have been replaced by independent copies. Let Ai be the 
event that, in K^ , SPT^,. ((1 — e) logn) contains one of Vi+i, . . . , Vk+i for some 
i = 1, . . . ,k. The remark above then implies by the union bound that 

k 

P{P^P*{vi,Vk+i) \w{P)<{l-e)\ogn}<J2P{M- (4) 

i=l 

Let Ni denote the number of nodes in SFT^,; ((1 — e) logn) in K^ . Observe 
that, conditioning on Ni, the probability that Ai does not occur is 



1-F{A,\N,} = n 1 —]\-l[n-N,>k-i + l] 




> "lax 1- ^ ^ J ' ,0 , (5) 



where 1 [ • ] denotes the indicator function. Let Ei (0) be the event that Ni < 
2n^-% for j = 1, . . . , [log2(n72fc)J-l, let E,{j) be the event that 2^n^-'' < N, < 
2^'+ini-% and let Ei{l\og2{n^ /2k)\) be the event that N > 2Li°S2("72fc)J 
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Expressing P {Ai} as a sum of conditional probabilities, we have 

Llog2(ri72fc)J 

P{A,} = E \E,{j)}-F{Em (6) 

Llog2(n72fc)J 

< Yl P{A^ \E,{j)}-3V2Je-'\ 
j=o 

by Lemma 2. Using (5) to bound P {Ai \ Ei{j)}, 

Liog,(n72fc)j-i (fc„^^i)2J+ini"^ 3V2L'°g2(»-/2fc)j 
i=o 



6(fc-z + l) ^ 23J/2 3V2L'°g2("V2fc)J 
- n'^(l-l/2fc) ^"^^ gSLio.sC^vs-.)] 

< l^fcl±ll + .-, (7) 

where we have used the fact that X]j>o 2'^-'/^e~^^ < 1 and the last inequality 
holds for all k < 121ogri, and n sufficiently large. It follows that, since fc > 1, 



EP{AJ<- 12 ^ )+k)< 



U2 



2 

which, together with (4), completes the proof. □ 

The following fact follows immediately from the proof of Lemma 4 (there 
are just fewer terms in the sum (6)); it will be useful later. 

Corollary 5. Fix c > 2. For all n sufficiently large, for all e > clog log n/ logn 
and any path P — vi, . . . , Vk+i in Kn with 1 < fc < 12 logn, we have 

13fc2 

P{P^P*ivi,Vk+i) I (l-2e)logn< w(P) < (l-e)logn} < -—. 

It follows from Lemma 4, at least intuitively, that to get a lower bound of fc 
on the length of the longest minimum-weight path, we can instead bound from 
below the probability that there is some path with k edges of weight (1 — e) logn. 
Given a positive integer fc and real e > 0, let 'Pk,e — 'Pk,e{n) be the set of paths 
with fc edges and weight at most (1 — e) logn in As a first step, we remark 
that by (3), 

Let cte be the solution of 

aloga-a(l + log(l -e)) = 1, (9) 
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so < a* and — > a* > e as e — 0. When t = k — \ogn = o(Vlogn), we 
then have 



(l + o(l)) ^eV,, 



V27raelogn \a 



Letting 



fee = log n- fi^ log log n where [3^ = — , ( 10) 

2(l + logae-log(l-e)) 

the following estimate for ElPfc^cj, for k close to fcg. 

Lemma 6. Let a^, f3e and k^ be defined as in (9) and (10) above. Given e with 
< e < 1/2, there is a positive constant Ce such that E\Vkt.e\ = (1 + o(l))ce 
and, for \t\ — o{y/logn), 



t 

e 



In particular, it follows that if < = t{n) — oo then E|7'fc^+t.c(n)| — > oo. To 
derive a lower bound on the probability that there exists such a path, we use the 
second moment method, and now introduce the version of it we require. Given 
any random set S of paths in Kn and two paths P and Q, let 95 (P, Q) be the 
probability that P and Q are both in S, and let 

A(5)=5]g5(P,Q), (11) 

P.Q 

where the sum is over pairs P, Q of distinct but intersecting paths in Kn- By 
Corollary 4.3.4 of Alon et al. [1], we then have 

Given the preceding discussion, a natural choice for the set S would be 'Pk,+t,e, 
for some t = t{n) tending to —00 with n. Unfortunately, for this choice of 5 and 
for the values of t we wish to consider, the quantity A(5) is too large for (12) 
to yield a useful bound. However, it is both useful and instructive to proceed 
as though this was our choice of iS, and see how far we can get. 
Given paths P and Q in Kn, let 

q,{P,Q)^V{w{P) < (l-e)logn,u;(g) < (l-e)log7i}. 

Also, for i e M, let 

At = At{n,e)=Y,q,{P,Q) (13) 

where the sum is over pairs P, Q of distinct but intersecting paths with [fc^ + 
edges in Kn- (So, Aj is just A(5) when S is the set of paths in Vfkt+t'],<L-) 
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3.1 Light intersecting paths 



In order to bound At, we first decompose the sum in (13). For a given real 
number t and integers i,j with 1 < j < « < [fc^ + <] , let At^j be the sum of 
qe{P, Q) over paths P, Q with [fc^ + 1] edges, such that P and Q share i edges, 
and these i edges form precisely j connected components. Then, we have 

At^ At,,,. (14) 

l<j<i< \kc+t] 

We now need to consider (a) the probability that two paths of k edges (for 
k near to k^) sharing i edges both have weight at most (1 — e)logn, and (b) 
the number of such pairs of paths. The following lemma bounds the former 
probability. Counting the number of terms of a given sum At,i.j is the subject 
of Section 3.2. 

Lemma 7. Given two paths P,Q in Kn, each consisting of k edges, i of which 
are common to P and Q, and any s > 0, we have, 



P {w{P) < sn, w{Q) < sn} < 4^" 



{2k 



Proof. To make the formulas easier to read, introduce w' {P) — w{P)/n and 
w'{Q) ~ w{Q)/n. This corresponds to the case of exponential edge weights 
with mean 1 instead of the exponential mean n for the individual edge weights. 
Thus, we need to evaluate P{w'(P) < s,w'{Q) < s}. Observe that the sum of 
i exponential random variables is has a Gamma(l,^) distribution, with density 
function /^(t) = t^~^e~*/{£— 1)! and distribution function Fg{t). So, condition- 
ing on the aggregated weight of the i shared edges, we see that 



P{w'iP)<s,w'iQ)<s}^ f f,{t)Fk-,{tf 

Jo 



dt. 



However, for t > 0, fe{t) < t^-^ l{l - 1)! and Fi{t) < t^/t\, which implies that 
V{w\P)<s,w'{Q)<s} < ,. ^ ' dt^ I ^ ^ 



[i-iy. (fc-i)!2 (2k -iy. \ k-i 

Using the classical bound for the central binomial coefficients (^^) < 4" com 
pletes the proof. □ 



3.2 The number of intersecting pairs 

What is actually needed is to count the number of pairs (P, Q), where P and 
Q are two paths containing k edges such that P DQ has i edges in j connected 
components. More precisely, PnQ is a graph composed of j disjoint paths, with 
i edges in total. Our aim is to analyze shortest paths in the graph, and hence 
it suffices to consider self-avoiding paths (that do not intersect themselves even 
at vertices). Let Nksj denote the number of such pairs. 
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Lemma 8. The number of pairs of self-avoiding paths {P, Q) of length k such 
that P n Q contains i edges in j connected components satisfies 

Proof. Observe first that if fc < i + 2j — 2 then N^^ij = 0, since for a given path 
P, if Q shares i edges in j connected components with P then Q has at least 
z + 2 j — 2 edges. We now assume that k > i + 2j — 2. 

We first focus on the choice of P with the edges of P n Q distinguished. 
First fix a path P, self- avoiding, as an ordered sequence of fc + 1 vertices. There 
are {fj^i) • {k + 1)! such choices. (In fact, we are double counting here, as the 
reversed sequence would yield the same path P; we can afford to ignore this fact 
in obtaining our upper bound.) We next choose the sequence of sizes of the j 
parts of PnQ as they appear along P. Since POQ contains i edges, it suffices to 
split i into j ordered parts; there are ('^"') possibilities for this partition. Now 
that we have the sequence of sizes of the parts of P n Q, it remains to choose 
the positions in P where these edges appear. We can count the number of such 
choices as follows: once we have removed PnQ, there remains a sequence of 
k — i edges of P. The portions of P n Q can be inserted at any of the k — i + 1 
separating positions (the extremities are included), and hence there are (''"j^^) 
choices. So in the end, fixing the path P together with the edges of P n Q can 
be done in 

distinct ways. 

It now remains to choose the second path Q, also self-avoiding, so that it 
intersects P at the distinguished edges. We first choose the order in which the j 
parts of PnQ appear in Q: there are j\ possible choices. The intersection PnQ 
contains i +j vertices and we need k + l — i+j other vertices to complete Q. It is 
possible that some vertices of Q\{PnQ) are in P, so we can choose an ordered 
sequence of these vertices in (j,"7-7-j) + 1 ^ * ^ j)! different ways. (In fact, 
not all such sequences yield valid choices for Q\P. For example, Q\P should 
not contain two consecutive vertices from P in the same part, or Q n P will not 
be what we claimed. However, we only seek an upper bound, and so can afford 
to ignore this issue.) Finally, we need to choose how the j (now ordered) parts 
of P n Q interlace with these (fc + 1 — ?' — j) extra vertices. The extra vertices 
define k — i — j + 1 intervals (with the extremities): there are C'~^~''~^^) ways 
to choose j of them (and insert the parts of P n Q at these spots) . It follows by 
this argument that for a fixed path P with distinguished edges forming j parts, 
there are at most 

.:■(,:-:!>—-..:-('--■-) M 

possible choices for the path Q. 

Now, the desired number N^aj of pairs (P, Q) of paths of length k such 
that PnQ contains i edges in j connected parts is at most the product of 
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the numbers appearing in (15) and (16). The desired bound follows by routine 
bounding using the inequality (^) < □ 

3.3 Paths intersecting at least twice 

Lemmas 7 and 8 yield more than sufhcicnt control over all l^t,i,j with j > 2, 
which we quantify in the following lemma. 

Lemma 9. There is an eo > such that for all < e < eo, if k = logn + 
o{\/logn), then for all n sufficiently large 

Y.2<j<i<k ^ ^-0.95 

Proof. Combining the bound on the number of intersecting paths (Lemma 8) 
with Lemma 7 with s = (1 — e) log n/n, we obtain 

At,ij := iVfc,,j • P {w{P) < sn, w{Q) < sn} 

< n2-J(2/c3)J((l_e)logn)2'^-' 



(2k -iy. 



< „2-,(2fc3).Y^i^^l£l^V''^4^-^ 
\ 2k ~ i J 

since, for £ > 1, we have i\ > (i/eY. We have assumed that j > 2, so for n large 
enough, {2k^y-n?-^ < 4^^. It follows that 



.6".,. 2<^<fc V (2fc-z) ; 



2k-i 



2<j<i<k 

Using the estimate (8) for E|Pfc_£|, it follows that, for n large enough, 

^-^^^'<-t^-- < I^max|fl^ii^ili^^yY^Vl-(17) 
E[|7'fc,,|] ~ 2<i<fc|V 2fc-i J \2k-iJ 

We bound the right-hand side of (17) by allowing i to take real values. We write 
k = 7 logn and a; = /31ogn (so < /3 < 7); we then see that 



4e(l — e) log 77, \ / 2k 



2/c X / \ 



2k 



^a{l3) logn 



where 

' 27-/3 \ ^ , / 27 



g{p) :=/31og ^ ' \ +27 log 



,4e(l-e)y ' ''V27-/3, 
For 7 close to a* , the function g is suitably approximated by g* defined by 

5^(/3):=/31og(^j+2aMog(^^-^ 
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By differentiation, we see that g*{j3) is maximized on [0, a*] by taking /3 = 
2a* — 4, at which point ~ 1.02. For any 5, we may ensure that I7 — a*| < 5 

for n large by choosing e sufficiently small. Since g{(5) = g^{(i) is bounded and 
continuous in both 7 and (3 away from (3 G [27,cx3), for all e sufficiently small 
and all n sufficiently large we have 

l3(/3)-g*(/3)| <0.01 



for all k = logn + o{\/\ogn) and all (3 G [0, 7]. It follows that g{(3) < 1.04 for 
all (3 £ [0,7]; which, combined with (17), yields that 

S2<j<z<fc ^t,i,j ^ Ink^ _ I log^ n 
This proves the lemma. □ 



4 Dealing with paths intersecting only once 

Unfortunately, for paths intersecting only once (i.e., when j = 1), for some 
values of i the quantity At ^ 1 is too large for us to apply the straightforward 
approach used above. (Note that this is not an artefact of our upper bounds: 
Lemmas 7 and 8 are essentially tight up to logarithmic factors. Also, two short- 
est paths between two pairs of vertices typically have one connected component 
in common, so this class of paths is the main problem.) The most natural and 
naive way to deal with this complication is to simply throw away all pairs of 
shortest paths P and Q whose intersection is connected, and try to bound the 
probability that one of the remaining minimum-weight paths is "long" (in the 
same sense as above). This is essentially our approach. However, in order to 
keep a handle on the conditioning imposed in doing so, it is useful to proceed 
"from the other direction": building a set of shortest paths with special proper- 
ties that guarantee that (a) no pair of such paths has an intersection which is 
connected, yet (b) the set contains a minimum- weight path with about a^logn 
edges. 

In order to describe this set, we first need to introduce a few concepts. We say 
that paths P and P' intersect once \l Pr\P' has only one connected component 
containing at least one edge. (There may be other components of P n P' which 
are isolated vertices.) We say that a path P is a local optimum if, for all paths 
P' with |P| = |P'| that intersect P once, we have w{P') > u'(P). From now 
on, let Ok denote the set of paths with k edges that are local optima, let Ok,e 
denote the set of elements of Ok with weight at most (1 — e)logn. Also, let 
V* = P(?^) = {Pij : 1 < i < J < denote the set of shortest paths in Kn- 
Note that Ok,^ ^ Vk,e- 

An instructive example. To motivate our next definitions, consider the 
unlikely (impossible) but instructive event that we find a path P = (ui , . . . , Vk+i ) , 
all of whose edges have weight exactly 1, and consider a path Q of length k in- 
tersecting P in exactly i < k consecutive edges — say Q C] P = (wi, . . . ,z;i+i), 
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for example. Let Qi (respectively Q2) be the component oiQ\P containing vi 
(respectively Wi+i). In order that 'w{Q) < k, then certainly we must have both 
w{Qi) < k — i and w{Q2) < k~i. On the other hand, one of Qi and Q2 has at 
least {k — i) /2 edges. It follows that if w{Q) < k then in cither SPT^jj (fc — i) 
or SPT^^^j {k — i) has height at least {k — i)/2. 

This observation is key to our approach. Roughly speaking, we wish to 
consider long minimum-weight paths with the special property that none of the 
shortest path trees leaving them are "too tall". This property will guarantee 
that the paths are local optima (Lemma 14), and we can then use the second 
moment method to prove concentration for the number of such paths. However, 
the idea suggested by the above sketch gives a little too much away in terms of 
what "too tall" means; in particular, it is not careful enough about the interplay 
between the contributions of Qi and Q2 to the length of Q. When we formalize 
our idea, we will have to deal with this interplay to make the details work out. 



4.1 The fluctuations of conditioned partial sums 

In the above example, there was a second simplification: the edge weights have 
the extremely desirable property that for every subpath P' of P, w{P') is ex- 
actly E [ w{P') I w(P) = fc], which makes the conditions we need to impose on 
the shortest path trees leaving P very easy to state. It is, of course, too much 
to ask for paths whose edge weights arc as well behaved as in the above exam- 
ple. In general, the weights of subpaths will fluctuate considerably from their 
conditional expected value given w{P). Our tool for controlling the size of these 
fluctuations is the law of the iterated logarithm; we will use the version found in 
Rogers and Williams [11]. (These bounds are in fact stronger than we require; 
however, we did not see a substantially simpler proof of a simpler but sufficient 
result; furthermore. Corollary 12 is perhaps of interest independent of its role 
in the current work.) 

Theorem 10 ([11], Corollary 16.5). Let ¥±,¥2, . . . be independent and identi- 
cally distributed random variables with finite 
Then 

td/i- >gn - nE¥i \ 

P < limsup = 1 > = 1. 

I n^oo -^Var [yi] 2nloglogri J 

We then immediately have the following finite statement, more useful for our 
purposes. 

Corollary 11. Let ¥±,¥2, . . . be independent and identically distributed random 
variables with finite variance, and let Sn — X]j=i ^j- all 6 > there is a 
constant C > depending on 6 and the distribution of¥i (we write C{6, Yi) for 
short) such that 



P < sup 

I n>l 



^Var [¥i] 2n log log 7 



<C \ >1-S. 
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(To avoid annoying technicalities in our formulae, in Corollary 11 and here- 
after when we write log log n we mean max(loglogn, 1).) At this point, the fact 
that we are considering exponential random variables comes in very handy, as it 
allows us to apply Corollary 11 conditional upon the value of S'„. More precisely, 
we have 



Corollary 12. Suppose Yi is an exponential random variable. 
S > 0, there is C = C {5) such that for all n sufficiently large, 



Then for all 



P{yke [n], 



Sk - Sn{k/n) 



^Var[Y, I 5„]2fcloglogfc 



> 1 



(18) 



We emphasize that the probability in (18) is a random variable, measurable 
with respect to Sn] the content of the lemma is that this random variable is 
deterministically at least 1 — (5. In particular, if we let £ be the event in (18) 
(so (18) bounds P {£ \ 5„ }), then P {£} Es„ [P | 5„ } ] > 1 - (5 and more 
strongly, for any event £' which is measurable with respect to Sn, 

P{£\ £'} = EsJP{£ \Sn}\ £']>l-S. 

Proof of Corollary 12. Fix 5 > and n. For i = 1, . . . , n, let E'i = 5j/5„. Then 
the set {El, . . . ,En} is independent of Sn and distributed like n independent 
[0, l]-uniform random variables (for a proof, see Chapter 8 of [12]). It follows in 
particular that Ei, the minimum of n independent uniforms, has a Bcta(l, ?? — 1) 
distribution and Var [Yi \ Sn] = {Sn/n)'^{n-l)/{n+l) > {Sn/nf/i, for n > 2. 
Using this estimate of the conditional variance, (18) may be bounded as 




Sk " Sn{k/n) 



VVar[ri I 5„]2fcloglogfc 
k 



< c 



Sn 



Sr. 



y/2k\og logfc 

"It ~ ^ 




V2fcloglogfc 



where we used the independence of the Sk/Sn of Sn- Thus, proving (18) reduces 
to finding C" such that the unconditioned statement 



P < sup 

.l<fc<r 



{n/Sn)Sk - k 



\/2k\og logfc 



<C'\>1 



(19) 



holds. By choosing C large enough we can certainly ensure that this holds for 
small n; shortly wc will take advantage of this fact. By Corollary 11 we have 
that 



P < sup 

l<fc<n 



Sk 



^/2fclog logfc 



< Ci ^ > 1 



(20) 
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where Ci = C((5/2,Yi) as in the statement of Corollary 11. If, on the other 
hand, there is fc S {1, . . . , 71} such that 



{n/Sn)Sk - k 



y/2k\og logfc 



for some c > 0, then either the event whose probability is bounded in (20) must 
fail to hold, or by the triangle inequality we must have 



{n/S„ - l)Sk 



\/2k log log k 



> c, 



for some k, or more simply \Sn — n\ > C\J2k log log k ■ Sn/Sk- For this to occur, 
we must either have |5'„ — n\ > yjcn or Sk > y/c{2klog logfc) • {^/n — y/c). 
It follows that, for n large enough that y/n — ^/c > \/n/2 (and recalling that 
log log fc > 1 for all k by convention) , we have 



sup 

l<A:<n 



{n/Sn - l)Sk 



\J2k log log k 



> c 



< ^P{^fe>fc\/^}+P{|^„-n|>V^} 



fc=l 



fe=i 



(21) 



where we have bounded the first term using a large deviations bound for sums 
of exponential random variables [2] with H{x) := x — 1 — logx, and the second 
term by Chebyshev's inequality. We now choose c large enough that the right- 
hand side of (21) above is at most 5/2 for all n. Combining (21) with (20) and 
taking C" = Ci + c (or slightly larger if necessary, to deal with small n) then 
establishes (19) and completes the proof. □ 

Motivated by Corollary 12, we say that a path P = (ui, . . . , Vk+i) in Kn is 
C -legal (for a given C > 0), and write P E Cc = l^c{n), if for all i = 1, . . . , /c. 



< Cyj2l log log 7, 



and 



k 



w{P) 



< CV2iToglog7. 



(22) 



(23) 



j=k-i+l 

We need both conditions as we wish to ensure the path is well-behaved "from 
both ends". Though these conditions only restrict the weight of subpaths of P 
starting from an end, they can be combined to control the behavior of other 
subpaths. More precisely, let 



m(i, fc) 



— l,fc + l — i} and s(i, j, fc) = max{m(i, fc), to(j, fc)}. (24) 



Intuitively, s{i,j, fc) is the furthest distance from either i or j to one of the ends 
of the path. We then have 
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Lemma 13. If P = {vi, . . . , Vk+i) is C -legal then for alll<i<j<k + l, 



k 



w{P) 



< 2C ■ V2i(*Xfc)l^gT^gR«Xfc))- (25) 



Proof. Write P{i) (respectively P(i,j)) for the path (vi, . . . ,Vi) (respectively 
{vi, . . . ,Vj)), then apply (22) and (23) to P{i) or P\ P{i) according as i < 
(A; + l)/2 or i > (fc + l)/2 and to P{j) or P \ P{i). similarly. Finally, use one 
of the equalities 

p{i.]) = p{])\p{i-i) ^ {p\p{i-i))\{p\p{])) = p\{p{i-i)yj{p\pm), 

together with the triangle inequality to prove the result. □ 

4.2 Bounding the heights of shortest path trees 

We have now introduced all the technical apparatus we will need to control the 
path P itself. Returning to our example from page 12, we recall that our main 
goal is to bound the heights of the shortest path trees leaving P; Lemma 13 tells 
us just how tall we can allow them to be. We say that P is C-bonsai if: 

{■k) for alH = 1, . . . , A: + 1, and all integers i with £ > m{i, k)/AO, in 

SPTy U + 2CV500£loglog€) has height less than f , ) • £. 

\10w{P) J 

We denote the set of C-bonsai paths in Kn by Be = Bc{n). 

Lemma 14. For any C > 0, if w{P) < k < 4iy(P) and P is both C -legal and 
C-bonsai, i.e., P € Cc H Be, then P is a local optimum. 

Proof of Lemma I4. Let P = (ui, . . . , Vk+i) G Cc H Be- Our aim is to show 
that for any integers i,j with l<i<j<k-\-l and any path Q of length k 
whose intersection with P is {vi, . . . , Vj), we have 'w{Q) > ■w{P). By symmetry 
we may assume that i < (fc + 1)/2 (so m.(i, k) = i — 1) and that m{i, k) < m{j, k) 
(so s{i,j,k) = m{j,k)). 

Let Qi (respectively Qj) be the component of Q \ P containing Vi (respec- 
tively Vj). We must have \Qi\ + \Qj\ = k — {j ~ i) > m{i,k) -\- m{j,k). Let 
h = (10/9)(u;(P)/fc)|g,| and define kj similarly. If \Qi\ > km{i,k)/{40w{P)) 
then since P is C-bonsai and w{P)/k > 1/4, 



w{Qi) > h + 2Cy/500ki log log h > h + 2C^Jm{i, k) log logm(i, k). 

(The constant 500 ensures that the second inequality holds.) Similarly, if \Qj\ > 
{m(j,k)/40)k/w{P), then 



w{Qj) > kj + 2C^500fcjlog logfcj > fcj + 2C^m{j, k) log log m(j, k). 
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If both these conditions occur then by the definitions of ki and kj we have 
w{P) 



MQ^) + w{Q,) > + \QA) + 2C^sii,j,k)log\ogsii,j,k). 

Since w(P) < /c, combining the preceding equation with the lower bound on 
'w{P n Q) given by Lemma 13 completes the proof in this case. 

On the other hand, if \Qj\ < km{j,k)/{AOw{P)) < m(j, fc)/10 then \Q,\ > 
m{i,k) and 

1^1^., .^^ k) 9{k - (j - i)) 9m(j, k) 

m>ik-ij~^))~^^> — ro — 

Since P is C-bonsai we have w{Qi) > ki + 2C-\/500fc,; log log ki. Moreover, in this 
case, ki > {w{P)/k)[k — {j — i)) and ki > m{j, fc)/4 = s(i, j, fc)/4, the preceding 
equation implies that 

w{P) 



w{Qr) > -^—{k - U - i)) + '2Cy/s{i,j,k)log\ogs{i,j,k), 

which, combined with Lemma 13 and the fact that w{P) < k, completes the 
proof in this case. The case \Qi\ < {m{i, k)/AO){k/w{P)) is similar to but easier 
than the case \Qj\ < {m{i,k)/40){k/w{P)). □ 

We are now in a position to fully define the special set of paths we wish to 
consider. Given e > and a positive integer k, let lA^^k,c ~ i^e.k.cin) be the set 
of paths P of length k in /i„ for which the following events hold 



A2 

A3 



{(1 - 2e) logn < w{P) < (1 - e) logn}, 

{P G V*} (26) 

{PeCcn Be}. 



We remark that if e > is small enough (in particular, any e = o(l) suffices), 
C > is fixed, t = t{n) — o{^\ogn), and P £ l^e.\kt+i],c (where fc^ was defined 
in (10)) then for n large enough, w{P) < k < Aw{P), so, by Lemma 14, P is a 
local optimum. The following lemma bounds the expected size oiUe^k,c- 

Lemma 15. There exist C > and S > such that for any fixed c > 2 and 
any e with S > e> c log log 71/ log n, for any function t{n) = o(^/logn), and for 
all n sufficiently large 

Proof. Recall that h{T) denotes the (unweighted) height of a rooted tree T. Fix 
e > as above, and a path P of length k = \ke +t']. It sufhces to prove that 
for any such path P, P {P € l^e.lk,+t],c} > "^P {P ^ ^rfc,+t],c} for a suitable 
6. Define the following conditional probability 

Pe{ • }:=P{ • I (l-2e)logn <u'(P) < (l-e)logn}. 
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Then, by the definition (26), we have 

P{Peu,,lk,+t^.,c} = P{Ai,A2,As} 

= Pe {A2, A3} •?{(!- 2e) logn < w{P) < (1 - e) logn} 
= P, {A2, A3} ■ (1 + o(l))P {w{P) <{l-e) logn} 
(by (3) 

= P,{A2,^3}-(l + o(l))P{Pe7'rfc.+ti,.} . 

So to prove the lemma it suffices to show that P^ {^2,^3} is bounded away 
from zero. To do so we first write 

P,{A2,^3} = P.{^e7'*,Pe/:c,PeSc} 

> P,{PeBc,PeCc}-Pe{P^V*}- 

For e as above, P^ {P ^ V*} ^ as n ^ cx) by Corollary 5, so we need only 
prove that P^ {P G Be, P G Cc} is bounded away from zero. To do so, rather 
than considering the set Be we consider a slightly different collection, which 
we denote B^- We let P G Bq if (★) page 15 holds with the term w{P) re- 
placed by (1 — e) log 71. Given that w{P) < (1 — e)logn, we have B^ C Be, 
so P, {P eBc,P& Cc} > Pe{P& B'c, P e Cc}- But the event P e is 
independent from the events P e Cc and 

Ai = {(1 - 2e)logn < w{P) < (1 - e)logn}, 

since they are determined by disjoint sets of edges. Thus, we have 

P,{P e Bc,P e Cc} >P{P e Bh}Pe{P e Cc} ■ 

Now choose C = C"(l/2), where C"(l/2) is as in Corollary 12. By that corollary 
we then have P^ {P G Cc} > 1/2, so 

Pe {P e Be, P e Cc} > e ^c) • 

To complete the proof, it thus suffices to prove that P{P e B^.} is bounded 
below by a constant. 

We assume e is small enough and n is large enough (say n > hq for some ??o) 
that 

(l-^)l°g" < ± (27) 

\h+t^ -3.5- ^ 

(Note that this is possible by the value of a*. We also remark that the preceding 
equation is the only place we use the hypothesis that e is "small".) To further 
simplify our lives, we now consider a subset of Bq that we denote Hi^i ^c,i, 
where Bc,i is the set of all paths Q = (wi, . . . ,Vk+i) in Kn for which, for all 
integers £ with i > m,{i, fc)/40 (recall that m{i, k) is defined in (24)), in 

h (SPT„. (e + 2CV500£loglog£) ) < 3.15£. (28) 
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The set Dii^i Bc,i is indeed a subset of Bq, since 9fc/(10(l — e)log7i) > 3.15 
by (27). The events P € Bc,i, i = I, . . . ,k + 1 are all increasing in the weight 
values, so by the FKG inequality [1, 4, 5, 6], we have 

k+l l(k+l)/2\ 

P{PeBh}>Y[P{PeBc,,}> n P{P(^l3cA\ (29) 

where the second inequahty takes advantage of the symmetry. For small £, the 
probabihty that (28) occurs is bounded away from zero: it is at least the prob- 
ability that all n — 1 edges leaving Vi have weight at least £ + 2C V500£ log log i, 
and every edge weight has an exponential distribution with mean n. Thus, 
choose an integer £o large enough that the following four conditions are satisfied 
for e>io, 



2CV500^1oglog£ < 1.05£, / e'^"^/« < e-^/174 



1 _ > cxp (-e-^/io) \ 3e^/40exp(-e^/20) < e-^/io/4 

Then there is ao > such that for £ < io, the event in (28) occurs with 
probability at least ao. Furthermore, for each £ the event in (28) is increasing 
in the weight values, so the FKG inequality yields 

00 

P{PeBc,^} > n P{/i(sPT, (£ + 2CV500^1oglog£)) < B.lS^j 

£=[m(i,fc)/40] 

> 4' • Yi P{/i(SPTj (1.05£)) < 3.15^}, (30) 

£>£oVm(i,fc)/40 

where we have written = max{£o ~ m{i, fc)/40, 0}. We now turn to bounding 
these remaining probabilities for £ > £a. By Lemma 2, and our choice for £0, 



P {|SPT, (1.05£) I > el "} < 3e^/«exp (-e^/« 



< . 



Furthermore, with the following values and our choice for £q, the bound of 
Theorem 3 yields 



P -e/10 

P {h (SPT, (1.05^)) > 3.15^ I ISPTi (1.05^) | < e^ " } < 



It follows from the two preceding equations that for £ > £0 

P{/i(SPT, (1.05£)) < 3.15£} > h >l--^>e-'= 

Combining this inequality with (30) yields 

P {P G Be A > 4' exp I - I ^ "0" exp (-26e~"('^'=)/4ooj . 

\ £=[m(i,fc)/40] / 
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Observe that the multiplicative factor q;q' = 1 but for the values of i such that 
m{i, k) < 40^0- Observe also that for i < {k + l)/2, we have by definition (24) 
m{i, k) — i — 1. So, together with (29) the preceding equation yields 



L(fc+i)/2j 

P {P e Bh} > exp I -52 V e-^'-i)/^" 



1=1 

which completes the proof, since the series in the right-hand side converges. □ 

With Lemma 15 under our belt, we can now complete our proof of Theorem 1 

Proof of Theorem 1, lower bounds. Let C be as in Lemma 15, choose e = e{n) = 
3 log log 77,/ log n, and choose a function t — t{n) = o{^\ogn) such that t{n) 
—oo with n; we will make a more precise choice for t(n) shortly. As noted just 
before the statement of Lemma 15, if e = o(l), C > is fixed, t = o{^/\ogn), 
and P G U^jk,+t^,c then for n large enough, w{P) < k < 4w(P), so, by Lemma 
14, P is a local optimum. Our choices of e, C, and t satisfy these conditions, so 
for n large enough we have 

By Lemmas 6 and 15, and the definition of k^ in (10), there is K > such that 
choosing t(n) = /Clog log n, for n large enough we have 

E|Z^rfe,+t],e| > logn. 

Recalling the definition (11) of A(Z^^fe^_|.f^ _j), we remark that since U^k^+t].e.c ^ 
(^[fce+t],ei the contribution of pairs of paths who intersect once to AiU^f^^^t-^^^), 
is zero. Also, since W[fe^+t],£.c ^ "^[fcj+tl.e j we have 

2<j<i<[fej+t] 

so by Lemma 9, for n large enough, we have A(U^k^+t],e) £ n~'''^^E [jPi-^^^+j^ gj] ^ . 
By our choice of e there is K' > such that k^ + t > a* \ogn — /T' log log n. It 
follows by the second moment method (12) and Lemma 15 that 

P ( max |Py I < a* log n - A" log log nl < P { \U^k.+t^ J ^ O} 

1 , A{U^„^+t^,e) 



< 



1 



^ 1 + ^2 

iogn 0^ ji^oo 
proving the lower bound in probability. We then have from the previous bound 



E 



max |Ph| 



> {a* log n~ K' log log n) [ 1 



log n 

which proves the lower bound in expectation. □ 
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5 Conclusion 



Wc have shown that the longest minimum-weight path in a complete graph 
with i.i.d. exponentially distributed edge weights is asymptotically Q!*logn in 
probability, where a* « 3.5911 satisfies a* log a* = a* + 1. The difficult part of 
the proof is the lower bound as the upper bound follows easily from Devroye's 
[3] results on the height of a random recursive tree and a simple union bound. 
The fact that such a simple union bound leads to correct bounds means that 
the n shortest path trees SPT^ essentially behave as if they were independent. 

The lower bound is proved by a second-moment argument for a carefully 
chosen random set of minimum-weight paths, all of length about a* logn, that 
satisfy certain regularity conditions. This careful choice allows us to control the 
variance of the size of the set of such paths. The proof shows that, with high 
probability, there exist minimum- weight paths of length about a* log n whose 
total weight is close to logn. Thus, these extra-long paths have a total weight 
just like a typical minimum-weight path. 

Finally, just as Janson [8] did in proving his results discussed in the in- 
troduction, we can use a standard coupling argument to extend Theorem 1 to 
distributions other than exponential. To explain the coupling, it is useful to di- 
vide the edge weights by n, so to consider the Exp(l) edge weights X'^ = Xe/n. 

Let Z be any non-negative random variable, with distribution function F(t) = 
P {Z < t}, and let F~^{t) = supja; : F{x) < t}. Let U he a random variable 
uniform on [0, 1]. Then F~^{U) is distributed as Z. We may thus couple the ex- 
ponential edge weights X'^ to uniform edge weights Ue by setting X'^ = i^^^(C/e), 
where Fsit) is the distribution function of an Exp(l) random variable. We have 
Fsit) = 1 - e"* = t - + 0{t^) ast 0, and it follows that for all edges e 
of K^, \X'^ - Ue\ = 0((X^)2), uniformly when X'^ < 1/2. 

Of particular use for us is the following consequence: for all edges e with 

< 121ogn/n (say), we have \X'^ — Ue\ = 0(log^ n/n^), so for any path P 
with O(logn) edges and with w'{P) := J2eeE{P) -^'e — 121ogn/ri, we have 



It then follows easily that the conclusions of Theorem 1 hold for uniform [0, 1], as 
well as exponential, edge weights. (Technically, one must redo all the arguments 
of the paper keeping (31) in mind, and accordingly adjust some of the constants 
very slightly; we omit this step.) 

In fact all we really need is a bound of o(l/n) in (31) in order to extend 
Theorem 1 to uniform [0, 1] edge weights. Thus, by another coupling as above, 
we may extend Theorem 1 from uniforms to any other edge weights Y whose 
distribution function Fy satisfies Fyit) ~ ct + olt/log^t) for some constant 
c > 0, as t 0. 




(31) 
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